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Abstract 

We identify different helicity degrees of freedom of Fierz-Paulian massive gravity 
around generic backgrounds. We show that the two-parameter family proposed 
by de Rham, Gabadadze, and ToUey always propagates five degrees of freedom 
and therefore is free from the Boulware-Deser ghost. The analysis has a num- 
ber of byproducts, among which (a) it shows how the original decoupling limit 
construction ensures ghost freedom of the full theory, (b) it reveals an enhanced 
symmetry of the theory around linearized backgrounds, and (c) it allows us to 
give an algorithm for finding dispersion relations. The proof naturally extends to 
generalizations of the theory with a reference metric different from Minkowski. 



1 Introduction and Summary 



The unique Lorentz-invariant and ghost-free hnearized theory of massive gravity was first 
formulated by Fierz and Pauh [1]. As such, the theory propagates the right number of five 
degrees of freedom, pertinent to a massive spin-2 particle, when considered sufficiently close 
to Minkowski background. However, through an analysis of constraints, Boulware and Deser 
showed that an extra negative-energy degree of freedom arises at nonlinear level [2] (the 
so-called Boulware-Deser ghost). 

The analysis of Boulware and Deser, while being generically applicable to Fierz-Paulian 
massive gravity — the set of nonlinear completions of the Fierz-Pauli mass term (FP), over- 
looked the possibility of a clever choice of the potential at higher orders so as to avoid 
the sixth degree of freedom. Using a more tractable description of massive gravity based 
on Stiieckelberg fields, proposed by Arkani-Hamed, Georgi, and Schwartz (AGS) [3], de 
Rham and Gabadadze made a promising step in that direction [1] . They constructed a two- 
parameter subfamily of FP that was free from the Boulware-Deser ghost in a special, yet 
nontrivial limit. 

The theory, first formulated as an infinite series of higher order corrections to quadratic 
Fierz-Pauli action, was later nicely resummed by de Rham, Gabadadze, and ToUey [5]. 
This made possible an extensive investigation of different classical solutions of the theory 
and comparison with General Relativity (GR)Q We henceforth refer to this two-parameter 
subfamily of FP as dRGT. 

dRGT certainly has many good features compared to generic FP. First of all, and by 
construction, it is free from the Boulware-Deser ghost in the decoupling limit provided that 
there is no background vector field [1]. Unlike generic FP, in which the asymptotically fiat 
spherically symmetric solution is not unique (essentially because the order of equations of 
motion is higher than 2) [T7], dRGT has a unique solution [S]. Moreover, the homogeneous 
and isotropic solution is stable (actually static) [TU], in contrast to the unstable solutions in 
generic FP [18]. These all suggest that the theory should be completely free from the sixth 
degree of freedom. 

An elegant proof of this was first found by Hassan and Rosen [12] . Using a special repre- 
sentation of dRGT [2D] , they showed, in the canonical formalism, that after a redefinition of 
the shift vector the lapse remains a Lagrange multiplier and provides necessary constraints 
to remove the Boulware-Deser ghost (for complementary discussions see [21] and [22]). The 
proof was later extended to generalizations of dRGT with an arbitrary, instead of Minkowski, 
reference metric [23] and bigravity models |24jPI 

In this paper to prove the absence of the sixth degree of freedom we take another approach 
which is based on the following notion. Although the fiuctuations of FP close to Minkowski 
background are stable, the way in which Boulware and Deser's non-linear ghost manifests 

iPor some examples see [Q [71 il El [IHl HH [El [13l [H [15] . For a review see [16] . 

^The phenomenology of the resulting bigravity models have been considered in [25 | (26 1 (27 1 128 j . 



1 



itself is that the spectrum of fluctuations around other backgrounds generically contains 
an extra ghost-like mode. Therefore, if one shows that in a particular member of FP the 
spectrum of perturbations around all backgrounds contains five degrees of freedom, that 
theory is free from the Boulware-Deser ghost. Obviously if the analysis can be carried out 
only in a continuous region of the configuration space, as is the case here (see Sec. 13.41) . the 
conclusion holds only in that region. So, our strategy would be to identify different helicity 
degrees of freedom around generic backgrounds and show that in dRGT there are (almost) 
always five modes. The proof easily generalizes to theories with arbitrary reference metrics. 

To perform the analysis we use the fact that any Lorentz-invariant theory of massive 
gravity can be formulated as GR plus a theory of four scalar fields 0"^, ^ = 0, 1, 2, 3, with an 
internal Lorentz symmetry. The essence of the proof is to align the field-space basis of the 
scalars 0"^, with the coordinate basis of the given background. As long as this alignment is 
possible one of the scalar fields can be shown to be nondynamical in dRGT, while generically 
all of them are dynamical in FP. We will see that the alignment is possible unless the unitary 
gauge metric is too far from Minkowski. 

It is worth noting that the above scalar-tensor formulation of dRGT greatly simplifies 
in two space-time dimensions and the full nonlinear theory can be shown to propagate 
zero degrees of freedom [2^ [5U] . We were unable to generalize those methods to higher 
dimensional versions; instead, we concentrated on the behavior of the theory expanded 
around different backgrounds. 

As a byproduct, our approach illuminates a number of (so far) less explored aspects of 
FP in general, and dRGT in particular: 

• It allows us to understand when the rather controversial Stiieckelberg field that was 
introduced by AGS to describe the helicity-0 mode fails to do so. Our formalism 
provides the alternative definition that is always applicable (let us call this field it). 

• It becomes clear how the original criterion imposed by de Rham and Gabadadze, 
namely, the absence of self- interactions of it, directly results in the absence of the 
Boulware-Deser ghost in the full theory. 

• It enables us to give a practical prescription to obtain dispersion relations of vector 
and scalar modes around generic backgrounds. 

• It follows from our analysis that in dRGT, if one freezes the metric to be fiat, the afore- 
mentioned theory of four scalar fields becomes degenerate on linearized backgrounds - 
it propagates only two degrees of freedom. Therefore, mixing with gravity is necessary 
to give dynamics to the third mode and the cutoff of the theory becomes of order 
A3 = (M,,m2)V3. 

• Once nonlinearities with a length scale / are considered, for / <C gravity can be 
decoupled and the cutoff increases to A; = {Mpim/iy/^ . 
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We emphasize that despite the absence of the sixth mode in dRGT, the two vector and 
one scalar degrees of freedom may become superluminal [3T| [32] or ghosthke [331 [13] around 
some backgrounds. We hope our approach assists similar analyses around a larger class of 
solutions. 

The organization of the paper is as follows. In Sec. 121 we review FP and dRGT with an 
emphasis on their relevant properties for the rest of the discussion. In Sec. [3l we study the 
behavior of the theory around the important class of linearized backgrounds. The unique 
feature of dRGT will become evident at this level. In Sec. HI we extend the analysis to 
nonlinear backgrounds to complete the proof of ghost freedom. And finally, in Sec. [5] 
we provide a recipe to find dispersion relations of vector and scalar modes. Given the 
contribution of nonlinearities to the dynamics of the scalar mode, we determine the upgraded 
cutoff of the theory. 



2 Preliminaries 

The following subsections are meant to provide a concise but relatively self-contained review 
of (a) Fierz-Pauli massive gravity from a modern point of view, and (b) the built-in features 
of de Rham-Gabadadze-ToUey theory. For the most part, we follow discussions of [3] and 
[1] , with the addition of a few complementary comments, calculations, and corollaries. 



2.1 Massive Gravity as a Theory of Four Scalar Fields 

Let us start with the Fierz-Pauli mass term for gravity 

-\Mim'J d'x{hl-e), (1) 

where m is the graviton mass and hf^^, = g^^ — rj^y is the deviation of the metric from 
Minkowski. The contraction of indices can be done using either rj^'^ or g^'^ since the theory 
is specified only up to quadratic order in h^^. 

The action ([T]) appears to break reparametrization invariance of GR. However the invari- 
ance can be easily restored by introducing four scalar fields 0"^, A = 0, 1, 2, 3, also known as 
Stiieckelberg fields |3] (see also [M])- Defining the covariant tensor 

H^,u = 9i,u - VABdf,(j)'^d^(j)^ , (2) 

a reparametrization invariant version of ([T]) can be written as 



j d\^g^^g'"'{H^pH,, - H,,H,^) . (3) 



One can now fix the unitary gauge by choosing = (p^ to recover the original action (JT]). 
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Given the ambiguity of the Fierz-Pauh action beyond quadratic order it is natural to 
consider adding higher order corrections in H^^, to ([3]) to form a family of Fierz-Paulian 
theories of massive gravity, which we denote in short by FP. The action of the most general 
FP can then be written as 

S = Seh + Sfp, (4) 
Seh = Mil f d^x^R, (5) 



S 



FP 
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I d'x^ V{Hl^) , (6) 



where V depends only on the invariants made out of = g^'^H^y, and, upon expansion 
around = 0, it reproduces (j3]) at leading order. 

Recalling the definition of H^^, in ([2]), we see that a function of invariants made of Hj^ 
effectively depends only on the invariants made out of the tensor g^'^da'p^dycjy^riAB- So we 
can write 

V = V{g^''d„<P^d,<l)''vAB) , (7) 

which makes it manifest that any FP is equivalent to GR plus a theory of four scalar fields 
(j)^. This is in fact valid for all theories of massive gravity; see [35] for non-FP examples. 
These scalar fields are noncanonical (have dimension of length), and are derivatively coupled 
(hence have a shift symmetry). In addition, the action remains invariant under a global 
internal Lorentz transformation 

0^ ^ A^</>^, (8) 
which is necessary to ensure the Lorentz invariance of the theory in the Higgsed phase 

Thus we see that to covariantly define FP, four new scalar fields have been added to GR. 
Since the theory is now generally covariant the metric is expected to have only two tensor 
degrees of freedom. One therefore naively expects to get a theory with a total of six degrees 
of freedom, exceeding what is needed for a massive spin-2 graviton by one. 

However, as is well known, the special structure of the Fierz-Pauli mass term is such that 
this extra degree of freedom is absent in the linearized theory. To see this in the Stiieckelberg 
formulation and also to set up a basis for the rest of the paper, we first introduce some useful 
notations and comment on them. 



2.2 Some Notations 

Instead of fixing the unitary gauge let us write (j)^, after changing its index to Greek, as 

= x^" - A^" , (9) 
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and g^i, as 



gfiu = ViMu + h^y, (10) 

so that ([2]) becomes 

H^y = h^y + d^A, + d^.A^-d^A''dr.A^, (11) 

where A^ = rj^^A'^ . We can then formally expand the action ^ in terms of A^^ and h^^, and 
the resulting action can be viewed as an interacting theory of a tensor h^y, and a seemingly 
vector A^ on flat space. To avoid carrying along the volume integrals we use Lagrangian 
density and decompose it into three pieces 

- \y/^V{Hl^) =Ch + ChA + i:-A , (12) 

a potential for h^^, couplings between hfj^^ and A^, and a pure A^ Lagrangian. We frequently 
refer to Ca as the pure scalar sector as it describes the fluctuations of the scalar fields on 
a frozen flat metric, hf^^ = 0. 

Note that in f ll2p A^^ always appears with one derivative acting on it and h^^y without 
derivative. All indices are raised and lowered by the Minkowski metric, and for the sake of 
simplicity we sacrificed the mass dimension so that Ch, '^hA, and Ca are dimensionless. 

Note also that the notational twist from Latin to Greek indices in (j9]) does not, by 
itself, make A^^ a vector field, rather, the symmetries of the effective action around a given 
background determine how the fields regroup into representations of the Poincare algebra. 
Around Minkowski background, where 

h% = {Ky)=0 and A^^ = (AO = 0, (13) 

a space-time Lorentz transformation x'^ = A'^x'^ , if followed by the same Lorentz transforma- 
tion in the field space 0'^ = A^ip^ ■, gives an identical action for the Lorentz transformed field 
A'^ = K'^A" . Therefore, we can call A^^ a vector field-the perturbations of the four scalar 
fields form the components of a vector field around Minkowski space. As will be thoroughly 
discussed, this is not necessarily the case around generic backgrounds and the use of Greek 
indices is just for convenience since we are expanding g^j_y around the Minkowski metric, the 
same metric as tjab which contracts internal indices. 



2.3 Counting Degrees of Freedom of FP Close to Minkowski 

Now we can return to the analysis of Fierz and Pauli's special construct. If we look at 
the quadratic Lagrangian of A^ and /i^^, that is, if we consider small fiuctuations around 
g^lu = Vfj.u and 0^ = x'^, it is unambiguously dictated by the Fierz-Pauli term The pure 
scalar sector looks like 

= ~{2d^Ayd^Ay + 2d^AydyA^ - Ad^A^dyAy) = -^F^, + total derivative. (14) 
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This is the Maxwell theory that propagates two transverse vector degrees of freedom with 
helicity 1. Therefore in the pure scalar sector the perturbations of the four scalar fields 
acquired an accidental U{1) gauge symmetry, making two of them nondynamical. 

Of course Ca is not the whole story and what provides the dynamics of the fifth, helicity-0 
degree of freedom is the mixing of A^^ with gravity: 

4a = -Ih'^'id.A, + duA, - 27],,d„A'^) . (15) 

This term breaks the U{1) gauge symmetry and makes another component of A^ dynamical. 

To show this, we slightly modify Boulware and Deser's analysis of quadratic FP action. 
In the ADM formalism [SB], the quadratic GR action can be put in the canonical form in 
which the perturbations of lapse and shift, /iqo and ha, are Lagrange multipliers: They do 
not carry any time derivative, which makes them nondynamical, and they appear linearly 
in the action. Thus the equations of motion for /iqo and /loi remove four out of six a priori 
degrees of freedom in hij to give two transverse-traceless tensor modes. 

After the addition of the covariant Fierz-Pauli term ([3]), the role of /iqo and /loi as Lagrange 
multipliers is preserved because of the general covariance of the theory. Moreover, the zeroth 
component of becomes nondynamical. This is because the only place that the time 
derivative of Aq appears in quadratic FP is in C^f^\, where from f[T^ we have 

- huAo , (16) 

with a summation over repeated indices. This can be integrated by parts to give a time 
derivative of an already dynamical field hij and make Aq nondynamical. Therefore, only 
three degrees of freedom are added to GR. 

Therefore, our general strategy to count degrees of freedom around a given background 
will be to first look at the pure scalar sector Ca and see how many components of A^ 

(2) 

are dynamical there. If C\ was degenerate, that is, if one or two components remained 
nondynamical, then we investigate generally breaks the accidental U{1) gauge 

(2) 

symmetry that may arise in . However as was the case here, upon integration by parts 
it can conceivably leave one of the components of nondynamical without making /iqo and 
hoi dynamical. If so, we are ensured by the general covariance of the theory to have five 
degrees of freedom; h^Q and hoi remove four out of nine a priori degrees of freedom coming 
from six components of hij and three components of A^. 

Let us confirm the above argument by an explicit calculation in the case of quadratic FP 
action which after separating time and space indices becomes 



l-F^i - -X. - hQ^d^Ai + hoi{A, + diAo) 



-h,,{d,A, - 5i,{-Ao + d,A,)) - -{hi - hi + 2h^ohii - 2hl) 



(17) 
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It is seen that unlike /iqo, hot is not manifestly a Lagrange multiplier. However, it becomes 
manifest once the nondynamical field Aq is integrated out. Using the reparametrization 
invariance of the theory to fix the gauge 

ha = 0, dAj = , (18) 

and also decomposing hoi and Ai into transverse and longitudinal parts 

hoi = hl + d,ij, (19) 
Ai = Aj + diip , (20) 

with dih^i = and diAj = 0, we get from the equation of motion for Aq that 

Ao = if-^. (21) 

Substituting this expression for Aq back in ( |T71) yields 



= M^im^ I (fx 



^Af + hlAj + hil- - Az^vi + ■ ■ ■ 



(22) 



where dots denote terms without 9o, /i^j, or ip. Now we can transform to the canonical 
formulation. Designating the conjugate momenta to Aj and respectively by tiJ and vr^, 
the action in the canonical form becomes 



(23) 



As expected hoi appears linearly and remains a Lagrange multiplier. On the other hand, (f, 
which is the longitudinal component of A^, has now become dynamical through mixing with 
gravity. 

Thus, the Fierz-Pauli term is a special combination that eliminates one of the six naively 
expected degrees of freedom at the linearized level. As Boulware and Deser showed, this 
in not generically true at nonlinear level-the sixth degree of freedom reappears and leads 
to a ghost instability. Therefore, although the fluctuations are healthy around Minkowski 
space (or more precisely when g^^ = rj^^, and 0^ = x^), they are not so around other 
macroscopically different backgrounds. This is of course not tolerable in a theory of gravity. 

One may wonder if there exist constructions similar to that of Fierz and Pauli but at 
higher orders such that the sixth degree of freedom is completely eliminated. And indeed 
a two-parameter family of candidates (dRGT) was proposed by de Rham, Gabadadze, and 
ToUey based on an analysis in the decoupling limit, using a new Stiieckelberg field. 



2.4 A New Stiieckelberg Field 

We cannot do justice in explaining the concept of decoupling limit and its fruitfulness; the 
interested reader is referred to thorough discussions in |31ll]. Here, and in the next subsection. 
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we instead point out some relevant aspects and consequences of the decoupling limit analysis 
that led to dRGT. 

Motivated by the fact that the longitudinal degree of freedom of a massive non-Abelian 
gauge field is the most strongly coupled component, Arkani-Hamed, Georgi, and Schwartz 
(AGS) introduced a new Stiieckelberg field by decomposing as 

A^ = V^ + d^n , (24) 

so that the theory is furnished with a f/(l) gauge symmetry 

Vf,^Vf, + d^a, -K ^-K-a. (25) 

In addition, since each field comes with one derivative, each tt carries two and the action 
acquires a Galilean symmetry tt — )■ vr + h^x^ + c. 

This new Stiieckelberg field, which is supposed to describe the longitudinal scalar mode 
of massive gravity at high energies, is not as well-defined as the original scalar fields which 
were introduced in Sec. 12.11 because the index in is a field-space index not a space-time 
one. However, as we argued above, around Minkowski background the fluctuations of the 
four scalar fields indeed behave as components of a vector field. As long as we perform 
simultaneous Lorentz transformation in the space-time and field space, the two indices are 
indistinguishable. So it is legitimate to introduce vr as in (l24l) to describe the longitudinal 
component. On the other hand on a generic background the meaning of vr can become very 
obscure. 

In spite of this drawback the decomposition ( l24l) has been enormously fruitful in the 
study of massive gravity and greatly facilitates introduction and identification of the special 
features of dRGT. Later in Sec. [3]we will show when this new Stiieckelberg field does describe 
the longitudinal mode of graviton. 

Substituting (12^ in the quadratic Lagrangian, one observes from ( iMl) that tt drops out 
of C\ up to total derivative terms because of its accidental U{1) gauge symmetry, but £^ 
gives a kinetic mixing between h^y and tt 

45 = -/i^^(9^5.-ry^.n)7r. (26) 

After integration by parts this is proportional to 7rR^^\ where R^^'^ is the linearized Ricci 
scalar, and can be diagonalized by a shift of h^^^, 

h^u + mV?]^^ . (27) 

As a result we get a mass term for vr, mass mixing between tt and /i^i/, kinetic mixing between 
and TT, and most importantly a pure kinetic term for tt which, after restoring mass scales, 
looks like 

4'^ = ^M>Vnvr, (28) 
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and is the dominant piece at high momenta p ^ m. In fact, after canonically normahzing 
fields using the Einstein-Hilbert action (|5]), Eq. ( !T4|) (with the overall M^^m^ restored), and 
(I2S]), we get 

h% = Mpih^,, V,^ = MpimV , 7i, = Mpim^Ti, (29) 
and the high-energy, m/p — 0, limit of the quadratic theory around Minkowski becomes 

£(2) ^ hA - \fIAVc) + ^^cDvre . (30) 

The first term is the quadratic Einstein-Hilbert action describing two transverse-traceless 
tensor modes; the second is the square of the field strength of a f/(l) gauge field V^, describ- 
ing two transverse helicity-1 vectors, and the last term describes the scalar helicity-0 mode 
TTc, which is gauge invariant in this limit. 

As anticipated, the self-interactions of vr become the most strongly coupled interactions; 
that is, after canonically normalizing the fields, these self-interactions are suppressed by 
lowest mass scales (cutoffs). For instance, the action (|3]) gives rise to interactions of the form 

Ai ' Al ' ^^'^ 

where A„ = {Mpim"~^Y^'^. The decoupling limit is a limit in which one keeps only the 
dominant interaction(s). 

It was suggested by AGS and is very natural to remove these pure vr self-interactions 
by adding higher order terms to ([3]). For instance, the gauge invariant form of in 
and, consequently, the Fierz-Pauli structure are uniquely determined once we impose 
the requirement that the pure vr terms in the quadratic Lagrangian cancel up to a total 
derivative. Now, if there are higher order pure vr interactions there is always one or a few 
of them which dominate all nonlinearities. Around a macroscopic background they yield 
quadratic terms in vr which suggest the absence of the accidental gauge symmetry when Ca 
is expanded around that background [37t 138]. 



2.5 dRGT 

Quite recently, de Rham, Gabadadze, and ToUey found this nonlinear completion of the 
Fierz-Pauli mass term ([3]). The theory is an infinite series in terms of H^^, since as a 
consequence of general covariance if^i, is nonlinearly related to ir. For instance, to cancel 
cubic terms in ( 13T|) one introduces 0{H^^^) terms which then give rise to sixth order terms 
in vr, and so on. As becomes clear soon, the dRGT theory has only two free parameters, one 
at cubic and the other at quartic order. 

dRGT contains couplings between tt, V^, and /i^j,, such as 

h{dd7r)\ dVdVddTT,--- (32) 
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which, after canonical normahzation around Minkowski, are suppressed at least by powers of 
A3 = (Mpim^Y^^ (hence the terminology A3 theories). However, all pure vr self-interactions 
add up to total derivatives. The total derivative self-interactions of vr with two derivatives 
per field are closely related to Galilean interactions, and it was shown in [35] that in d 
dimension there is a unique such term at each order up to dth order. Beyond dth order these 
total derivatives identically vanish. 

This means that, barring the tadpole (the cosmological constant), after fixing the coeffi- 
cient of the quadratic term which determines the mass of graviton, one is free to choose d — 2 
coefficients for the d — 2 higher order total derivatives. Beyond that, it self-interactions must 
exactly cancel (without integration by parts) and there is no freedom. Since each nth order 
invariant monomial made out of Hj^ has a unique structure of pure vr terms, which spans 
from nth to 2?7.th order in vr, after fixing the coefficient of the total derivatives the theory is 
fully specified. Hence, the two parameter family of dRGT in 4d. 

The fact that self-interactions of vr beyond quartic order must identically vanish, imme- 
diately implies that if in dRGT we replace all fields in Ca with 9^7r, we must get zero 
beyond the quartic order. Therefore at those orders terms can be rearranged so that they 
contain at least one F^^^, = d^Aj^ — dyA^. 

However, writing d^^Ay in terms of the symmetric and antisymmetric part 

d^Ay = {F^y + S^.y)/2, (SS) 

one realizes that the antisymmetry of F^y forces us to have either zero or at least two F^yS 
in each term. So each term beyond quartic order must contain at least two F^yS. 

What about the lower order terms in Well, we have already seen the example of the 
quadratic Lagrangian in (IT^ -up to total derivatives it also contains two F^y^. The cubic 
Lagrangian is still easy to check explicitly. The only cubic self-interaction of vr that is a total 
derivative is 

{Uixf - mix^d^dynf + 2{d^dyTif , (34) 

(3) 

which means that apart from terms that contain two F^y^s, the rest of the C\ should 
combine into a multiple of 

^^J.^l ~ "iSaaS^y + 2S^y , (35) 
to yield ( l34l) upon substitution A^ = d^Ti. However ( l35l) can be integrated by parts to give 
12d^AyF^„Fya^ — Qda-Aa-F^y + total derivatives. (36) 

In Appendix |X] we systematically prove that all combinations of the form 0351) . which 
yield a total derivative upon A^ = 9^vr replacement, reduce to terms with at least two -F^^'s 
after integration by parts. So fortunately we do not need to check explicitly. 
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To summarize, the pure scalar sector La in dRGT can be written schematically as 

La = -^FF[1 + M H ] + total derivatives. (37) 

Therefore, dRGT can be uniquely defined as a subclass of FP in which La contains at least 
two -F^i^'s in each term, up to total derivatives. 

Another property of dRGT which is again a direct consequence of removing pure tt 
interactions is the special form of the coupling between /i^,^ and ir [U |33] 

3 

Ch. = Ku «n^r(n) + o{hl,) , (38) 

n=l 

where 11^;^ = dfj^d^n and the three X^^(n) are given by 

X^^iU) = e'''''''e''^'''U^pU,My5, (39) 

[see Appendix [B] for a derivation of (!38|) and (139|) ]. Note that aihfj^i^X!^'^ {U) is dictated by FP 
and must be equal to (l26l) . so ai = —1/2. The other two coefficients 02,03 are determined 
by the only two free parameters of the theory-the coefficients of the total derivative terms. 

As before, this form of Lkn in dRGT is very constraining, now for the scalar-metric 
coupling sector Lha- In order for Lha to yield ( 138|) upon decomposition = + d^ir, it 
must contain (138|) with the replacement 11^,^ — )■ S^i,/2. Also, the fact that there is no Xl^^ijl) 
beyond = 3 means that terms in Lha which contain one h^^ and more than three A^'s 
identically vanish after = O^tt substitution. Therefore these terms can be rearranged 
such that each of them contain at least one F^y. 

So LhA in dRGT is dictated to be of the form 

3 

Cha = V E "n^ri-^M^/S) + hF{dA +■■■) + 0{hl^) , (40) 

n=l 

where the second type terms are written schematically!! We emphasize that after fixing the 
two free coefficients of dRGT, La and L^a are completely determined but we do not need 
to know more than what is presented in (13 7p and (HQ]) . 

We next study FP (and dRGT) around different backgrounds. 



3 Linearized Background 

To understand the nature and properties of different degrees of freedom on a given back- 
ground, one first needs to look at the quadratic action for fiuctuations around it, as we did 

■^I am thankful to Lasha Bcrczhiani for pointing out an error in the original formula. 
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in Sec. 12.11 for Fierz-Pauli theory around Minkowski. To this end one can usually linearize 
the background since locally (for ultraviolet momenta) curvature effects are often negligible. 
We follow this route in this section but discover that, similar to the case of FP around 
Minkowski, the pure scalar sector of dRGT is degenerate around the linearized backgrounds. 
There is an accidental gauge symmetry which will be broken only through dynamical mixing 
with gravity. 

Consider a generic background of an FP theory ([6]), given by g"^^ in the unitary gauge 
{(f)^ = x^). Since the theory is generally covariant, to study the fluctuations around a given 
point P we can perform a coordinate transformation to a local inertial frame of P. Taking 
P as the origin of the new coordinate system the new background metric is given by 

9% = V,.u + h% = 7]^, - ^{R^au^ + + 0{xxx) , (41) 

and the background scalar fields (p^^ which are the unitary gauge coordinates can be Taylor 
expanded as 

€g = <i^) = <(0) + e^X + 0{xx) , (42) 

where the constant term x!^{0) will be ignored in what follows since 0^ is derivatively coupled. 
In terms of and the new metric (jH]) the unitary gauge inverse metric at point P can be 
written as 

g^P = g'^^d^x^^dpxX^^ = 9''^d^r,M^,,l=o = • (43) 

Thus the matrix appearing in P2l) is just the vielbein at point P. 

We now switch to the notation A'^ so that we can use the results of the last section and 
denote the fluctuations around the background by 

where 

K = - O^'^ + ^(^^) • (45) 

In this section we ignore h^^^, [which is 0{xx)] and nonlinear corrections to Abg to obtain 
the curvature-independent contributions to the quadratic Lagrangian for fluctuations of 
scalar fields and metric /i^j,. Taking h'jf^^ = has two consequences: Firstly, C^a \ the part 
of C^'^'^ which is quadratic in a^, receives contributions solely from Ca [see f|T2l) for definition]. 
Secondly, is sourced by terms in ChA which are linear in /i^j,. 

(2) 

We first concentrate on and distinguish two cases as follows. 
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3.1 Symmetric Background 



When dfj^A^jf is symmetric (or equivalently e^^ = r/^o-e^ = e^^ ) dRGT behaves in an inter- 
esting way because of its special form of Ca ( 1371) . The two common factors of F^y vanish 
when evaluated on the symmetric background, so they must be evaluated on perturbations 
and we get 

4'^=5'"^"^/M./a/3. (46) 

Here B^'^°'^ is a constant matrix depending on the background information which is encoded 
in e'^ and 

fixv = df^a^ - d^a^ (47) 

is the gauge invariant field strength of a^. Consequently, the quadratic Lagrangian for has 
an accidental U{1) gauge symmetry, and oq is nondynamical (there is no doao). Therefore, 
as in Maxwell theory, Ca describes propagation of two transverse vector modes. 

In any other FP theory, Ca does not contain two -F^j^'s per term and the quadratic 
Lagrangian for fluctuations would generically be of the form 

~ dada , (48) 

without any gauge symmetry. These types of theories are easily seen to contain four degrees 
of freedom, one of them being a ghost (see, e.g., [iO]). 



3.2 Nonsymmetric Background 

When d^A^^ is not symmetric (e^^ 7^ e^^), one can still use the symmetry of FP theories 
(pllTl) under internal Lorentz transformation of 0° fields to symmetrize the background. The 
action depends on 0" only through the combination g^'^do-4>°'dy(f)'^r]a/3 and for any global 
Lorentz transformation A° we have 

g>'''d^{At<P')dM^^r)Va(s = g^^d^<P%rrj,^ . (49) 



On the other hand an arbitrary matrix can be made symmetric (in the sense that 
Vfj-a^l = Vfa^Ji) by a Lorentz transformation on one of the indices 

^ Ke: , (50) 

as long as it is not too far from 5^ (we will be more explicit about this in Sec. 13. 4p . 

Choosing to be this Lorentz transformation, we now have a set of transformed scalar 
fields 0^* = A'^(f)'^. Defining, as before, 

= x'^ - , (51) 
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the background will be 

K = - ^'O^'^ ' (52) 

for which d^A!'^ is symmetric and we are back to the previous case. Therefore, the quadratic 
Lagrangian d"^^ for the redefined field 

a'^ = (53) 

will have a gauge symmetry in dRGT and describes two transverse vector modes, but not in 
any other FP. Before turning to the analysis of mixing with gravity, it is well-timed to make 
a few comments: 



• The symmetrization of the background can be achieved, equally well, by using the 
freedom in choosing a Lorentz transformed local inertial frame. To symmetrize 5^0^^ 
one can either do a rotation (or boost) in field space or an opposite one in coordinate 
space. 

• We mentioned earlier that switching from Latin to Greek indices, per se, does not 
change the nature of the fields. The important steps that packaged fluctuations of the 
four scalar fields (back to Latin indices!) into a vector field were (a) we transformed 
to the local inertial frame in which the space-time metric becomes the same as the 
internal metric rjAB- As a result, the field indices and the space-time indices see the 
same metric, as was the case for FP around Minkowski. 

However, there is still a big difference: now we have another nontrivial matrix 9^0^, 
or better said = rjABdf^cp^. (b) By rotating field space basis relative to coordinate 
basis we symmetrized d^(j)A so that it does not differentiate among space-time and 
field indices anymore. Only then can we interpret Greek indices on a'^ as space-time 
indices and the field itself as a space-time vector. This property is preserved under 
simultaneous Lorentz transformations in field and coordinate space. 

• Now we can see when it is legitimate to do the decomposition 

a^, = Vf, + d^w . (54) 

This is possible after transformation to the locally inertial frame (or more generally 
matching the space-time metric and the reference metric in any other way) and sym- 
metrization of the background, i.e. when becomes a space-time vector. 

• Now consider the original Stiieckelberg field which described the longitudinal mode 
around Minkowski 

A^ = V^ + d^TT , (55) 

and suppose we are given a background solution h'lf^, V^^, and iihg. The question we 
wish to answer is, when does the perturbation Sir around iThg describe the longitudinal 
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mode of graviton around this background? First of all, we see that going to the local 
inertial frame is impossible since = SA^ behave as four scalar fields under general 
coordinate transformation while d^Sn behaves as a vector field. Therefore, h^'Jl must 
be small such that different indices see approximately the same metric. Secondly, the 
background should be symmetric. An important example is when we have an almost 
purely tt background, such as the spherically symmetric asymptotically flat solution in 
the Schwarzschild frame and when r 3> |1T]. In these situations the background is 
trivially symmetric 

d^d^TTbg = dyd^Tibg , (56) 
and diT describes the longitudinal mode of the vector and the graviton. 

• Our analysis showed that unlike generic FP where £a on a generic linearized back- 
ground describes four degrees of freedom, in dRGT Ca describes two. It has an 
accidental U{1) gauge symmetry and oq is nondynamical. Therefore, as in the case of 
FP around Minkowski, the helicity-0 degree of freedom should get its kinetic term via 

(1') 

dynamical mixing with gravity C\^- 



3.3 Mixing with Gravity 

Since we are in the local inertial frame and the curvature effects have been temporarily 
ignored, the quadratic mixing between and h^^, comes entirely from terms in ChA that 
are linear in h^j^^. This makes the analysis very simple because as shown in (140|) these terms 
are constrained in dRGT to be of the form 

3 

ChA = V «nXr('5p./2) + hF{dA + ■■■) + 0{hl,) . (57) 

n=l 

The first term h^^X'^^ {Spc/2) is what we studied in the case of FP around Minkowski (fTSjl . 
We saw that the symmetry of the indices of is such that OqAq is multiplied by spatial 
components of the metric fluctuation hij. Therefore, we could integrate by parts the time 
derivative of SqAq to make Aq a nondynamical field without making the Lagrange multipliers 
of GR, /ioo and /ioi, dynamical. It follows that the longitudinal component of Ai becomes 
dynamical through mixing and we get three new degrees of freedom in addition to the two 
tensor modes of the metric. 

For a nontrivial symmetric background, the analysis of the first term does not change 

f 2") 

at all; we just need to replace A^ with a^. But, another contribution to comes from 
equating, respectively, one and two S^^s in X2{S/2) and X^{S/2) to their background values 
^]iu — '^9fj,Aj'. It again follows from the antisymmetry of the Levi-Civita symbols in the 
definition of X^'^ that the terms with dodQ cannot contain /iqo or hoi- More explicitly, we 
have from these terms 

- > (58) 
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where we have suppressed irrelevant indices and background fields. So, remains nondy- 
namical. 

(2) 

The only remaining part of iZj^j comes from the second type terms in flFTI) . where the 
factor of F^i, vanishes on the symmetric background and must be evaluated on perturbations. 
The contribution of these terms is, therefore, of the form hf in which ao is manifestly 
nondynamical without any need for integration by parts. 

Note finally that the presented proof of ghost freedom around linearized backgrounds 
goes beyond what is already known in the decoupling limit of dRGT. In common between 
the two analyses is that they are applicable only if curvature is negligible so that h^^, can be 
kept linearly. However, the decoupling limit truncation of dRGT is proven to be ghost free 
only around pure tt backgrounds, while in the presence of generic background vector fields 
it is not a consistent truncation of the theory and propagates a sixth degree of freedom. 



3.4 An Example 

To illustrate the power and limitations of our approach, let us study a simple but nontrivial 
example, where the invariant interval in the unitary gauge is given by 

ds^ = g'l^Jx'^dxl = -dtl + 5ij {dxi + 2l'dQ {dxl + 2Vdtu) . (59) 

This background was first considered in |12], and it was claimed that the fluctuations of all 
four scalar fields become dynamical around this background. Later it was shown in [29] that 
via a field redefinition, one of the fields can be made nondynamical and it was argued that the 
pure scalar sector propagates three degrees of freedom based on an analysis of Hamiltonian 
constraints. 

We now analyze this problem by going to the local inertial frame as prescribed in previous 
subsections. Note that the unitary gauge metric (!59|) is flat and can be transformed to 
Minkowski metric in a new coordinate system (t, x*) if we take 

tu = <Pl = t, (60) 
< = 4>l, = x'-2lH. (61) 

This is perfectly linear, but not symmetric: <9j0°g = while dQ(j)\g = —2l\ To symmetrize it, 
we can perform a Lorentz boost with parameter /* 

0° = ^=i=t(0° + W, 0* = ^i==(0^ + />°). (62) 
It can be easily checked that this transformation makes 9^0^^ = f]uad^4'hg symmetric. There- 

~ (2) 

fore, the fluctuations a'^ form a vector fleld. As shown before, the pure scalar sector C- in 
dRGT will depend on only through its fleld strength /^j,, therefore it describes propaga- 
tion of two transverse vectors. The third mode becomes dynamical solely through mixing 
with gravity. 
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Now we can see what it means to be too far from Minkowski to symmetrize the back- 
ground. It amounts to taking ll > 1 in ( 162|) . That is, unhke Euclidean space where any 
vector can be rotated to align with any other, in a space with Lorentzian signature, boosts 
have some limitations and cannot align a timelike vector with a spacelike one. The unitary- 
gauge metrics with a nonzero shift vector need a boost for symmetrization, so those with a 
very large shift vector are expected to be nonsymmetrizable. 

Let us see if the Hamiltonian analysis of Hassan and Rosen [19] can resolve this issue. 
In that approach one wants to redefine the shift vector A^* in terms of a new vector n*, such 
that the action becomes linear in the lapse A^. Since the GR action contains iV* linearly, the 
field redefinition should be linear in N. Consider for simplicity a 2d space-time; then the 
redefinition can be written as 

iyi = (1 + ND)n^ . (63) 

It can be shown using a representation of dRGT [20] that the coefficient D must satisfy 

v/1 - {n^yD = - (Dn^y, (64) 

where 7 is the Id spatial metric. The solution is -D = a/t"^ and using the two dimensional 
analogue of ( l59l) which has 7 = 1 and = 1, we get D = 1. Substitution in ( l63l) gives 
= N'^/2. Thus, quite similarly to the symmetrization fl62|) . when A^^ is so large that 
{n^y > 1, the redefinition (l63l) is ill-defined since the prefactor -^/l — {n^y in flMl) becomes 
complex. It appears that these metrics, if realized as background solutions, evade any proof 
of ghost freedom. 

The resolution, however, is provided by Hassan and Rosen. The equations of motion 
for ra* ensure that, as long as the unitary gauge is well-defined, (n*)^ does not exceed 1 and 
the field redefinition is possible. Therefore, the nonsymmetrizable "backgrounds" which are 
characterized by unitary-gauge metrics with very large values of shift vector are not true 
background solutions of the theory]^ 

We will not further investigate whether all nonsymmetrizable backgrounds fall into this 
category. In what follows we will focus on the continuum of backgrounds which are close 
enough to Minkowski to be symmetrized. 



4 Nonlinear Background 

In this section we generalize the analysis of the last section to nonlinear (or curved) back- 
grounds. The general procedure should have become clear by the intuition we gained from 
linear backgrounds. The field space basis must be rotated point by point so that the field 
and coordinate indices are treated symmetrically. Then the fluctuations a'^ form a vector 
field. 

"'I am thankful to Rachel Rosen for explaining this to me. 
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We break the proof into two steps. First we ignore curvature but allow for nonlinearities in 
0^^. This can be thought of as a locally fiat space but with nonzero gravitational acceleration. 
We next include curvature. 

4.1 Zero Curvature 

Let us try to follow the steps of the last section and see when modifications are needed. As 
before we can coordinate transform to a local inertial frame of an arbitrary point and set 
h% = in its vicinity. The background scalar fields are, in general, nonlinear so that 9^0^^ 
is space-time dependent. If this matrix turns out to be symmetric in a region much larger 
than the de Broglie wavelength of the modes in question then as before we conclude that 
the quadratic Lagrangian for perturbations C^^ acquires an accidental gauge symmetry and 
describes only two vector degrees of freedom. 

If the background is not symmetric, it means that it differentiates among coordinate and 
field indices. We then perform an internal Lorentz transformation of the field space basis 
to align them with the coordinate basis, thereby symmetrizing the background. Generically, 
however, a constant Lorentz transformation is unable to do the job and we need to use the 
larger symmetry of the theory under local Lorentz transformations A(x) 

g-\d(j))ri{d^f = g-\d(t))K{x)r]K^{x){d(t)Y . (65) 

Barring backgrounds that are too far from Minkowski we can find an appropriate A(a;) to 
align the two bases and symmetrize the background in the whole region so that 

[{dct>,g)K{x)r,Y = {dct>,g)K{x)r^ . (66) 

Having symmetrized the background, we can now return to the expansion of the theory in 
terms of A^. The theory depends on 0^ only through first derivatives, ducf)^'-, therefore, when 
we defined = x^ — (j)^ and expanded the action in terms of A^, it could be thought of as 
an expansion in 5^ — dycj)^. Thus, after the above transformation, dfj,A^ must be replaced by 

d^A^ ^5;-K{x)d,r . (67) 

We will see that the x dependence of A|^(x) leads to some modifications in the analysis of 
nonlinear backgrounds which make them fundamentally different from the linear ones. 

4.1.1 Diagonalization of the Perturbations 

First consider the perturbations of the scalar fields. According to fl67|) . we should replace 
dfj,a^ with K'^{x)dija" in Ca and Cha- When A^ was a constant, it could be passed inside the 
derivative and the perturbations a'^ could be diagonalized by defining 

a'^ = A^a- ^ (68) 
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where is a true vector field. In the case of x-dependent A(^(x), the same procedure yields 
an extra piece 

A^(x)a,a" = a,(A^(a;)a") - (9,A^(a;)) a'' = + Cj^X , (69) 

where in the last step we have defined 

d'' = A>:{x)a\ and C,^, = - (a,A^(x)) A^ (70) 

We emphasize that the above space-time dependent field redefinition is the manifestation of 
the point by point alignment of coordinate and field bases. 

The presence of the extra piece in fl69|l makes an important difference: Suppose we want 

(2) 

to derive Ca , the quadratic Lagrangian for the fluctuations in the pure scalar sector. In the 
last section we saw that after symmetrization and field redefinition (1681) . C\ in dRGT only 
contains the antisymmetric field strength /^j, = d^^d^]. That statement really meant that, 
upon lowering by Minkowski metric, the indices /i and u of A'^{x)di,a'^ are antisymmetrized. 
This gives /^^jy for constant A^; however, in general there is an additional term 

Af^ad^a" - Auad^.a" = /^^ + C[^,,]ad'' . (71) 

(2) ~ 

Therefore unlike linear backgrounds where C\ was formed out of two /^i/'s and had an 
accidental gauge symmetry, here we schematically have 

£f ~ // + Cfd + CCdd , (72) 

where the indices are suppressed. This Lagrangian looks a lot like that of a massive gauge 
field in which do is nondynamical, since there is no dodo, and three degrees of freedom 
propagate; two transverse helicity-1, and a longitudinal helicity-0. So, the nonlinearities of 
the background, which are characterized by Cj^^, break the degeneracy of the pure scalar 
sector and give a contribution to the kinetic term of the helicity-0 mode which used to get 
its dynamics through mixing with gravity. 



4.1.2 Nonlntegrability of the Background 

Next we consider the symmetrized background. According to fl67|) we should replace d^A^^^ 
with the following symmetric matrix 

{dA,gX ^ 6i: - A^^{x)d.<f>lg . (73) 

Again, when A^ is constant, we can commute it with the derivative and define a new back- 
ground field A'^g = — A^^iplg whose derivative is fl73|) . However, such a new background 
field does not necessarily exist when AJ^ depends on x; that is {dAbg) is nonintegrable. 

One may wonder whether this causes any technical obstruction in the analysis. Let us 
recall that the technical consequence of symmetrization was to make -F^j^'s in the expansions 
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La and LhA vanish on the background so that they had to be evaluated on perturbations. But 
the same will be true if we replace dp^A^ inside F^j^ = d^pAy\^ with any symmetric matrix, such 
as our {dA\,g)p^\ it still vanishes. So we still need to evaluate all F^J"^ on the perturbations, 
which in the present case are Ky,„[x)dyaF and the result was discussed in section 14.1.11 

The only technical difference caused by the nonintegrability of the background would 
arise when we integrated some derivatives by parts to get the desired form of La with two 
-F^j^'s per term. This is not permitted anymore and one needs to look at the original form of 
La- Fortunately, integration by parts was needed only for a finite number of terms, namely, 
the quadratic, cubic, and quartic terms that upon substitution A^ = d^n would yield total 
derivatives. We will systematically treat these terms in Appendix O and confirm that their 
contribution to L^a^ is still of the form ( 172]) . 

Finally, we should make sure that the structure of mixing terms in L\l does not change 
significantly; to be more precise, we should show that no OqCIq multiplies /iqo and /loi- Since 
^% ~ 0' °^^y need terms in LhA which are linear in h^y [Eq. fHOj) ]. 

3 

Cha = V E «n^r(^P J2) + hF{dA + ■■■) + 0{hl^) . (74) 

n=l 

Note that we never used integration by parts to obtain this; therefore, after symmetrization 
F^y in the second part must be replaced by f^y + C\^^y]„a" because {dAi,g)[^y] = 0. Hence there 
is no c^oSo in this part. One then needs to take one of the dfj^Ay^s in each X„ and replace it 
by a perturbation, which now reads dfj_dy + C^y^jd" . The d^dy part was encountered before; 
when /i = 1/ = it will not multiply /iqo or /loi because of the symmetry properties of X!^'' . 
The Cd term does not carry any derivatives. 

4.2 Curved Background 

Only one nontrivial step is required to generalize the proof to include metric curvature. It 
should have become evident by now that the key ingredient of the proof is alignment of field 
space basis and coordinate basis. So far, an easy step towards this alignment was going to 
the locally inertial frame where the background space-time metric and the field space metric 
TjAB become the same (in this section it will be equally convenient to use Latin indices for 
scalar fields). In a curved space-time, this can be done only at a single point and in its 
vicinity the two metrics start to deviate from each other according to 

= ^/.. + = - \{Rt.aup + + ^(a^a;x) . (75) 

This issue can be overcome by a technically simple but conceptually fundamental twist 
in our approach. Instead of attempting to align the curved space-time metric with the flat 
field space metric, we do the opposite by introducing vielbeins in the field space. Let us call 
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the background metric gf^u 



9,u{x) = g%{x) , (76) 



we can find a set of vielbeins tliat relate tlie original fiat reference metric tjab to g^y as 
follows 

VAB = e''A{x)e^B{x)g^p{x) . (77) 

As usual, one has freedom in the choice of the vielbein and the different choices are related 
by 

e^A ^ ^'uel , (78) 
where are generalized Lorentz transformations which preserve the metric ^^^1 

T^Kg^p = g,, . (79) 

They form a group which is isomorphic to the Lorentz group S0{1, 3). 
Once the substitution (1771) is done in the FP action 

S^p = --^Mlm^ j d'x^V {g^''eP^d„ct)\%d,cl>^-gp^) , (80) 

it can still be expanded as a sum of three pieces 

Sfp = I d^xV^iU + C,A + Ca) , (81) 



'pi' 

where Ch, ^ka-, and La are, respectively, the Lagrangians for the metric, coupling between 
metric and the scalar fields, and pure scalar fields. They are the same as Cka-, and La-, 
except for three modifications: (a) Instead of Minkowski metric, all indices are contracted 
by gp,y and its inverse, (b) The metric perturbation h^^^y is with respect to the new reference 
metric 

Qi^v = (jfiu + h^y . (82) 

And (c) all d^A^s should be replaced with 

dpAy (jf^y - (7^^e^(9^<^^ . (83) 

The background value of fl83|) can be symmetrized by exploiting the freedom fl78|) in the 
choice of e^. The result is a symmetric, but not necessarily integrable, matrix 

{dAhg)^y = g^u - g^^e^^df^^fg . (84) 
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We can now diagonalize the perturbations of the scalar fields a"^ by defining the locally 
aligned perturbations and the connection matrix C^^, such that 

9vae\d^a^ = d^a^ + C^^a^ , (85) 

where 

a, = ^M<xe^a^ , and, C^, = -d,{g,xeMxWe-'^ . (86) 

Now we are back to the situation of the previous subsection except for two minor caveats. 
In the analysis of the quadratic, cubic, and quartic terms in Ca which give F^jy only after 
integration by parts, we should also include the nonfiatness of the metric. However, this 
cannot qualitatively change our previous conclusion based on a fiat metric, because non- 
fiat metric does not change the symmetry of indices and its only effect is to introduce 
an x-dependent coefficient. But x-dependent coefficients were already present before (see 
Appendix [U]). Therefore, C^^^ will again be of the form ( I72p . 

The other caveat is that for a generic gf^,y, the position of indices matters. The dynamical 

(2) 

variables of GR are gij and at the linearized level hij. So we should make sure that in C\J^, 
dodo is multiplied by hij with lower indices. This is evident from the explicit form of ChA, 
where d^dQ appears only in the combination 

41 ~ h,,e'^''-e''^-d^d^ ■ ■ • , (87) 

and we have suppressed irrelevant indices and background matrices. This completes our 
proof of ghost freedom in dRGT massive gravity. 

We conclude this section with two remarks. First, that in our final approach the special 
choice of Minkowski reference metric, tjab, did not play any role. For an arbitrary space-time 
dependent reference metric '~fAB{x) one could define generalized vielbeins 

'jABix) = eA{,x)e^j^{x)g„p , (88) 

and the rest of the proof would proceed unaltered. Therefore, as first shown in [23], dRGT 
with any reference metric propagates five degrees of freedom. 

Secondly, the action f ET]) provides the right framework to consistently generalize the defi- 
nition of decoupling limit to cases where the background metric is different from the reference 
metric of FP. The Stiieckelberg field vr can again be defined to describe the longitudinal mode 
of the true vector field a^, and the hierarchy of interactions becomes evident once different 
helicity modes are canonically normalized. 

5 Dispersion Relations and the Cutoff 

Having established the number of degrees of freedom in dRGT to be five, one can come up 
with a more practical method of deriving dispersion relations for helicity-1 and helicity-0 
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modes. We saw that around linearized backgrounds the pure scalar sector is nondegenerate 
for transverse vector (helicity-1) modes. It is also easy to see that mixing with gravity has 
a negligible effect on them at large momenta p S> m. The relevant quadratic Lagrangian for 
these modes looks like 

£(2)~M^X(// + /i9a), (89) 

where /i^j, and are not canonically normalized. Normalizing them by taking = Mpima^ 
and /i^j, = Mpihf^i^, and assuming that canonically normalized variables are of the same order 
of the energy scale of interest confirms the above statement. 

Therefore, investigating the behavior of transverse vector modes is relatively easy. One 
can first linearize any background (since the nonlinearities introduce subleading corrections), 
then align the field and coordinate bases (for instance, by going to the local inertial frame 
and symmetrizing), and finally freeze the metric and just study the transverse excitations of 
the pure scalar theory such as ay{t,x) (perhaps using the resummed version of dRGT [5]). 

Some of the above steps may often be unnecessary. For instance, whenever symmetriza- 
tion of the background does not affect the mode in question, say when we study ay and 
the symmetrizing Lorentz transformation is a boost in the t — x plane, we do not even 
need to symmetrize the background. Nevertheless, symmetrization usually simplifies the 
computation. See [311 [32] for an example. 

We should remark that the coefficients and the matrices that contract indices in (l89l) 
depend on the background solution. Typically they are of the order of 9^01^^ which is 6'^^ 
on Minkowski background but can deviate significantly for different solutions (for instance, 
through Vainshtein mechanism [8]). Examples in which the vector modes become ghostlike 
or infinitely strongly coupled can be found in [33l [13] . 



5.1 The Scalar Mode 

Finding the dispersion relation of the scalar mode needs more effort. On linear backgrounds 
the pure scalar sector of dRGT is degenerate and nonlinearities are important. We saw that 
these nonlinearities generate mass for the properly aligned vector field a^, which provides 
dynamics for the longitudinal mode. However, so far it seemed impractical to find the explicit 
form of this term. In this section we introduce a method to find the leading mass term. 

As already mentioned, since now is a vector field one can safely define the Stiieckelberg 
field 71 

a^, = v^ + df,7i , (90) 

which captures the dynamics of the longitudinal mode at high momenta. The contribution 
of mixing with gravity to the dispersion relation of tt was discussed in Sees. 12.41 and 13.31 
The recipe to find the contribution of nonlinearities around a given point P is 
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1. Coordinate transform to a local inertial frame where P is located at = 0. The 
metric will look like 

9% = + = V>.u - + R^.(S.a)x''x^ + 0{xxx) . (91) 

2. Taylor expand the scalar field background (p'^g and symmetrize the linear part to obtain 

€g = - Ic^px'^x^ + 0{xxx) , (92) 

where after lowering the indices by Minkowski metric e^^ is symmetric. We denote the 
fluctuations of (j)^ by —a^. Note that is different from a fully aligned vector field 
only by terms that disappear at x = 0. We exploit this fact to obtain the leading 
quadratic Lagrangian C^^^ from the more accessible one C^a'^ . By leading we mean the 
part that survives x — ?■ limit. 

3. Neglecting h^Jl for the moment, one can expand the action to second order in 
around the background (p2l) . For this purpose the resummed version of the theory can 
be useful. It will become clear that one can keep only the linear and quadratic terms 
in (p2|) . so the result schematically looks like 

C^^^ ~ dada + Cxdada + CCxxdada , (93) 

plus terms which are higher order in x and therefore subleading. Since /ij^^ was ne- 
glected, what we have obtained is merely the contribution of the pure scalar sector Ca 
to C^^l 

4. Since no x-dependent symmetrization has been performed, we can use the fact that 
Ca contains two F^^^s per term up to total derivatives. F^j, on the linear part of the 
background which is 9^y4|^™'''^ = ?7^j, — e^^, vanishes since e^,^ is symmetrized. However, 
the nonlinear part is not necessarily symmetric and we have 

= -ClH^^'' ■ (94) 

So in the first term in fl93|) which does not have any C, both -F^^'s are evaluated on 
perturbations and therefore it can be written as // up to total derivatives. In the 
second term with one C only one could possibly be evaluated on the background 
so it can be written as Cxfda. And there is no restriction on the last term. 

5. Let us recall the role of the full nonlinear symmetrization. It was, essentially, an x- 
dependent field redefinition from to a^, so as to cancel the above non-gauge invariant 
terms of the forms Cxfda and CCxxdada. The field redefinition is an identity at x = 0; 
therefore, the first term of fl93|) . which can be written as //, remains the same in 
However, the x dependence of the field redefinition has the byproduct of generating 
Cfa and CCaa. These two terms, upon introduction of the longitudinal mode fl90l) 
and diagonalization, give 

~ CCdndTT . (95) 
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(2") (2) 

Yet there is a shortcut to obtain the leading part of and therefore Or from the 
quadratic Lagrangian C^a^ fl93l) without having to find the exphcit field redefinition to 
a'^. We already saw that the leading // terms in £i^^and match each other. To 
find the rest, derive the equations of motion for and take its divergence 

5a,. 



d,^ = 0; (96) 



this automatically projects out the gauge invariant part which describes the transverse 
modes. The result looks like 

Cdf + CCda + 0{x) = . (97) 

The first two terms are exactly what would have been obtained from the Cfd and 
CCaa terms in . The 0{x) terms, which vanish at x = 0, typically describe a 
ghost that emerges away from x = 0. This is an artifact of the incomplete alignment 
of field and coordinate bases. Neglecting them, from (1971) one can reconstruct Ca and 
obtain C^^. 

7. To see why higher order terms in (p2|) are negligible, suppose we included the cubic 
correction 6(j)'^g = D^pyX^'x^x^. Up to second order in x it would contribute to d"a^ by 
at most one insertion of nonlinearities 

Dxxdada . (98) 

However, as argued after ( IMl) these terms with a single insertion of nonlinearities can 
be recast in the form Dxxfda, which in terms of d corresponds to Dxfd terms. This 
is evidently subleading compared to previously considered terms since, after diagonal- 
ization, it yields the kinetic term DDxxdirdTT for the longitudinal mode. 

8. Finally, curvature effects are encoded in /i^^^ and therefore contribute through C^a- As 
before we extract terms that survive x — )■ limit. It turns out that only two pieces in 
ChA are relevant, namely, 

h%X^\ and h%X!;\ (99) 

where one in X!^'^ is put on the linearized background. The other terms in ChA fall 
into two categories: (a) terms that contain at least one F^jjy and (b) terms with higher 
powers of hfj_^. In the first case, after symmetrization of the linear part of 
in these terms can be evaluated either on the nonlinear part of the background which 
gives extra suppression in x or should be evaluated on the perturbations to give fda- 
type terms multiplied by ~ 0{xx), which are subleading. Similarly, the type (b) 
terms are suppressed by extra powers of x after evaluating all /i^j, on the background 



r (2) 

^The contribution of curvature (i?) to dynamical mixing with gravity is of the form C)^^ ~ Rxxhddn and 

(2) 

the only way it may survive x — > limit is to give C)^^ ^ Rhir. which is negligible at large momenta. 
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9. As explicitly derived in Appendix [Cl one can use integration by parts and the symmetry 
properties of X^'^ to transform (p9|) into the form 

£(2) _ Ri^aa + xaf + xxff) , (100) 

where R stands for Riemann tensor. The leading contribution comes from the first 
term Raa which using (Mil) gives 

~ Rdndn . (101) 



5.2 Summary 

There are, therefore, three types of contributions to the kinetic term of the scalar mode: 
mixing with gravity, nonlinearities of scalar fields C, and curvature R. After restoring mass 
scales the final result schematically reads 

~ Mlm^d-Kd-K + Mlm'^CCd'Kd'K + M^im"^ RdndTr . (102) 

While around Minkowski or other approximately linear backgrounds mixing with gravity 
(the first term) plays the main role, once the scale of nonlinearities or curvature becomes 
comparable to the graviton mass the latter contributions start to dominate. 

Therefore, given the strict upper bounds on the value of the graviton mass, even modest 
deviations from flat space can drastically change the behavior of the scalar mode. For 
instance, for some choices of parameters of dRGT, the scalar mode becomes a ghost in the 
self-accelerated solution [33] (of course, this is not the Boulware-Deser ghost since there are 
still five degrees of freedom). 

Finally, denoting the length scale of nonlinearities or curvature by /, f ll02p implies that 
the new cutoff of the theory around those backgrounds is given by 

A,^(^)'", (103) 

which can be much larger than A3 = {Mpim?Y/^ . 
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A Total Derivatives and dRGT 



It was shown in [37] that in d dimension for each n < d there is a unique nth order total 
derivative scalar made out of H^j^y = d^d^n. Using the Levi-Civita symbol, this scalar 
combination can be nicely represented as [33] 

Since is a total derivative, it can also be written as 

4:i(n) = 9,(9.vrxri(n)) , (105) 

where 

1 (9/:^"^ 

and by definition it must be conserved and symmetric {d^X^'^^ = 0). 
Our goal here is to show that the Lagrangian 

( q /9^ _ "'" c-"l---"d-nMl-A'nc- !^l---i^n C ...Q (^^\7\ 

which gives L^"),(n) upon substitution = d^-n, contains two antisymmetric F^^, up to total 
derivative terms. Recall that showing the existence of only one F^^ is sufficient, since its 
antisymmetry guarantees that they come in pairs. 

Let us write S^y as 

S,,y = 2d^Ay - F^, , (108) 
and substitute it in fll07p . The term that contains no F^^ will be 

on Qi---Otj_„/ii---/i„ lyi-'-Unfi A . . . pi A 

= T^d^, ■ ■ ■ , (109) 

a total derivative. So we must pick at least one F^jy. To see explicitly how it will be paired, 
rewrite all the rest of ^^J^Ay•$, back in terms of 5^1, and F^^^y. Then observe that the product 
of two Levi-Civita symbols is symmetric under the exchange of all {/ij} with all {z/j}, and 
therefore only terms with an even number of F^y-s^ survive. 



B Coupling to Gravity 



Consider a generally covariant action 

1 
4 



S = --Mlrr? I d'x^ V{H:) , (110) 
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where V is a. function of trace of different products of = g'^^H^y, and 

H^, = g^,-d^rd,r'nap. (Ill) 

Our objective is to find the leading couphng between h^j^^ = g^y — rj^^ and the scalar 
Stiieckelberg tt, when V is chosen to be dRGT. That is, when pure tt interactions are zero 
or total derivatives. As a reminder, tt and are defined as follows 

(j^i^ = _ j^i^ ^ (112) 

A^ = r]^,A'' = V^ + d^Tr. (113) 



The idea is to use the general covariance of the original theory formulated in terms of 
four scalar fields to establish a relation between different interaction terms in the action. 
Since 0^ are scalars, under an infinitesimal coordinate transformation (or diffeomorphism) 

^'fj. — _|_ -^g have 

(P^^ = x^^_A'' -> x'^ - A^(x') = x^" - (A'^(x') - ■ (114) 

This shift of A'^ can be attributed to the vector component so that vr would transform 
as a scalar 

V,{x) ^ V,{x')-^,, (115) 

7i{x) n{x'). (116) 

In addition, the metric perturbation /i^j, gets shifted under the above coordinate transfor- 
mation 

hpu{x) V(^') + 9{t^^ ■ (117) 



So, although the original action was trivially diffeomorphism invariant, since we are 
expanding it around a noninvariant background {g^^, = rj^y and = x'^), the symmetry is 
nonlinearly realized by and /i^j,. Therefore, the effective action for h^^, V^, and tt, which 
is intrinsically diffeomorphism invariant, fulfills this through a cancellation of the following 
transformations: 

-> h^,^ + d(^,^, (118) 
V, ^ V,-i,, (119) 
^ S^ + S^r-^., (120) 

and their higher order corrections which we do not need here. As always, there is also a 
change of variables of integration from x^ to x'^. 

Let us apply this to an interaction which is linear in h^y and of nth order in Il^y, which 
we write as hfj,y{W'Y^ . Under (11181) it changes by 

9(^6)(nT'^ + o(V)- (121) 
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This change can only be compensated by the change of the following terms in the effective 
Lagrangian: 

di^,y,){WY' and (n"), (122) 

respectively under the transformations flllQp and (11201) . However, both of these terms are 
absent in dRGT up to total derivatives. Therefore, (I12ip should also vanish up to a total 
derivative, or 

9^(1^^^ = ^, (123) 

This equation should be familiar from Appendix [Xj given the conserved (11")'^'^ one can 
construct an + 1st order total derivative 

Qtr = {dM^nn ■ (124) 

Therefore, we can identify (11"')^'^ with X^*^ defined in (I106P and conclude that the interac- 
tions of one hfj^i, and tt in 4d dRGT are of the form 

3 

Ch. = h^u 5^ a„Xr(n) + 0{h%) . (125) 

n=l 



C Quadratic Effective Action 

To calculate the effective action for perturbations around a nonlinear background, one often 
needs to replace d^A^^ with a symmetric but not necessarily integrable matrix [dA^^)^^. 
Moreover the perturbations S^aj, should be replaced by d^di,+C'^^,da- The resulting quadratic 
Lagrangian would in general be 

£(2) ^ 5M-p-(9^5^ + C^,d^){d,d„ + C^^^dp) , (126) 
where B'^^^'^ depends on the background fields. 

(2) ~~ ~ 

Our aim is to show that in dRGT, C- consist only of //, /a, and da. When the Cd 
term is chosen from both parentheses in fll26p . this is automatic. When one dd and one Cd 
are chosen, we have 

Cfl, = {BCr ^d.d^d,. (127) 

This may not generically look like fd but can be transformed into that by integration by 
parts. Writing BC in terms of symmetric and antisymmetric parts 

{BCY^P = - [{BCy^"'^P + {BCY^'^'^P] (128) 
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and denoting equality up to total derivatives by =, we have 



a — 

aaa 




(129) 




= ^(5C)[Hp/^^a, - ^d,{BCY^''K,a, - 


hBCY'^^^Pa,d,~ap 
2 




- -(BO^^'-'^'^f a - -d (BC)'^^"'^Pa a - 


-(BO'^^'^'^'^a if +d a ) 










= \{BCt-^Pf^,~ap - U^{BC)^^-^>'~a,~ap - 


^{BCY'^-'^P-aJ^p + ^d,{BCY^''^f~a,~a, 


which 


is of the desired form. 




So 


we only need to check dada terms. When the indices of df^ai, are antisymmetrized, 



as is the case for the contribution of all higher than quartic terms in Ca, we also get the 
expected form //. It remains to check the three £^^^^{3^,^/2) terms. On the symmetric 
background [S^Jl = 2((ii''^)^^], they give 

p(n,2) _ n{n — 1) "^-"^ I rl A^3\ . . . ( rl A^9\ 5 5 (A'MW 

After writing s^^ = 2d^ay — f^^, three types of terms arise: dfj^^a^^d^^ay^, in which the 
derivative can be integrated by parts to act on the background-dependent coefficient. 
It therefore becomes a^^d ^^0,^21 which is harmless according to (11291) . The mixed term 
•9^1 Oj/i 7^21^2 gives ffj.iuif (121^2/'^ because of symmetry reasons. And the last possibility is to 
choose both f^^s, which is automatically of the desired form. 

The introduction of nonflat background metric Qfj^^ does not change the above conclu- 
sion since it does not change the symmetry of indices and can be viewed as a part of the 
background-dependent coefficients. 

Finally, consider the case encountered in Sec. 15. H where we want to find the leading 
quadratic Lagrangian without fully aligning field and coordinate bases. In this case there is 
no x-dependent field redefinition and the curvature effects are encoded in /ij^^. One should 
therefore study hij_yX!^"[S^y/2) terms in ChA, when h^y and all S^^s in X!^^ except two are 
evaluated on the background 

(2) _ ^(^ ~ 1) ibg ^ai-aa_„p.p.i-p.ri-lr- uvi-v„-i nbg , , cbg 

This would look very similar to (I130p if we replaced /ij^^ by S*^^. Nevertheless the argument 
following (llSOp used only the symmetry of the background fields and, since /ijf^ has the same 
symmetry as 5*^^, it applies to this case as well. Thus (11311) is reducible to //, /a, and aa 
terms. In the case of interest, where ~ Rxx, one can ignore nonlinearities in A'^^ and 
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take 5"^^ to be constant. fll3ip can then be manipulated to give 

/'{2) _ n(n — 1) (;n...Q,^_^^^^...^^_-^ uui - u„-i qbg qbg 



X 



^4a^2 '2!^2 ^Ml ^'^l ~l~ 3/;yiiy2'2/i2'^Atl "I" 2/^2i^2'^i^l^Ml 2 '^'^1 /^'^ /^^l 1^2 ^ ^/J^ • (''-32) 
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